Overview

Dataset statistics

Number of variables12
Number of observations1000
Missing cells192
Missing cells (%)1.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory93.9 KiB
Average record size in memory96.1 B

Variable types

NUM7
CAT5

Warnings

Title has a high cardinality: 999 distinct values High cardinality
Genre has a high cardinality: 207 distinct values High cardinality
Director has a high cardinality: 644 distinct values High cardinality
Actors has a high cardinality: 996 distinct values High cardinality
Revenue (Millions) has 128 (12.8%) missing values Missing
Metascore has 64 (6.4%) missing values Missing
Title is uniformly distributed Uniform
Director is uniformly distributed Uniform
Actors is uniformly distributed Uniform
Rank has unique values Unique
Description has unique values Unique

Reproduction

Analysis started2020-12-12 07:22:12.163834
Analysis finished2020-12-12 07:22:26.780226
Duration14.62 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

Rank
Real number (ℝ≥0)

UNIQUE

Distinct1000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean500.5
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB
2020-12-12T12:52:26.972675image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile50.95
Q1250.75
median500.5
Q3750.25
95-th percentile950.05
Maximum1000
Range999
Interquartile range (IQR)499.5

Descriptive statistics

Standard deviation288.8194361
Coefficient of variation (CV)0.5770618104
Kurtosis-1.2
Mean500.5
Median Absolute Deviation (MAD)250
Skewness0
Sum500500
Variance83416.66667
MonotocityStrictly increasing
2020-12-12T12:52:27.231982image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
100010.1%
 
32910.1%
 
34210.1%
 
34110.1%
 
34010.1%
 
33910.1%
 
33810.1%
 
33710.1%
 
33610.1%
 
33510.1%
 
Other values (990)99099.0%
 
ValueCountFrequency (%) 
110.1%
 
210.1%
 
310.1%
 
410.1%
 
510.1%
 
ValueCountFrequency (%) 
100010.1%
 
99910.1%
 
99810.1%
 
99710.1%
 
99610.1%
 

Title
Categorical

HIGH CARDINALITY
UNIFORM

Distinct999
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
The Host
 
2
El secreto de sus ojos
 
1
Dark Shadows
 
1
The Mortal Instruments: City of Bones
 
1
The Help
 
1
Other values (994)
994 
ValueCountFrequency (%) 
The Host20.2%
 
El secreto de sus ojos10.1%
 
Dark Shadows10.1%
 
The Mortal Instruments: City of Bones10.1%
 
The Help10.1%
 
The Boy Next Door10.1%
 
Woman in Gold10.1%
 
Toy Story 310.1%
 
Miracles from Heaven10.1%
 
Chappie10.1%
 
Other values (989)98998.9%
 
2020-12-12T12:52:27.506248image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique998 ?
Unique (%)99.8%
2020-12-12T12:52:27.789735image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length61
Median length13
Mean length14.539
Min length2

Genre
Categorical

HIGH CARDINALITY

Distinct207
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
Action,Adventure,Sci-Fi
 
50
Drama
 
48
Comedy,Drama,Romance
 
35
Comedy
 
32
Drama,Romance
 
31
Other values (202)
804 
ValueCountFrequency (%) 
Action,Adventure,Sci-Fi505.0%
 
Drama484.8%
 
Comedy,Drama,Romance353.5%
 
Comedy323.2%
 
Drama,Romance313.1%
 
Comedy,Drama272.7%
 
Action,Adventure,Fantasy272.7%
 
Animation,Adventure,Comedy272.7%
 
Comedy,Romance262.6%
 
Crime,Drama,Thriller242.4%
 
Other values (197)67367.3%
 
2020-12-12T12:52:28.107918image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique85 ?
Unique (%)8.5%
2020-12-12T12:52:28.356222image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length26
Median length20
Mean length18.095
Min length5

Description
Categorical

UNIQUE

Distinct1000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
A small town is taken over by an alien plague, turning residents into zombies and all forms of mutant monsters.
 
1
An uptight FBI Special Agent is paired with a foul-mouthed Boston cop to take down a ruthless drug lord.
 
1
Author P.L. Travers reflects on her childhood after reluctantly meeting with Walt Disney, who seeks to adapt her Mary Poppins books for the big screen.
 
1
When Dr. Jane Foster gets cursed with a powerful entity known as the Aether, Thor is heralded of the cosmic event known as the Convergence and the genocidal Dark Elves.
 
1
A growing nation of genetically evolved apes led by Caesar is threatened by a band of human survivors of the devastating virus unleashed a decade earlier.
 
1
Other values (995)
995 
ValueCountFrequency (%) 
A small town is taken over by an alien plague, turning residents into zombies and all forms of mutant monsters.10.1%
 
An uptight FBI Special Agent is paired with a foul-mouthed Boston cop to take down a ruthless drug lord.10.1%
 
Author P.L. Travers reflects on her childhood after reluctantly meeting with Walt Disney, who seeks to adapt her Mary Poppins books for the big screen.10.1%
 
When Dr. Jane Foster gets cursed with a powerful entity known as the Aether, Thor is heralded of the cosmic event known as the Convergence and the genocidal Dark Elves.10.1%
 
A growing nation of genetically evolved apes led by Caesar is threatened by a band of human survivors of the devastating virus unleashed a decade earlier.10.1%
 
A teenager is magically transported to China and learns to convert his video game skills into those of a Kung Fu warrior.10.1%
 
Romantic sparks occur between two dance students from different backgrounds at the Maryland School of the Arts.10.1%
 
70-year-old widower Ben Whittaker has discovered that retirement isn't all it's cracked up to be. Seizing an opportunity to get back in the game, he becomes a senior intern at an online fashion site, founded and run by Jules Ostin.10.1%
 
The story of a tourist family in Thailand caught in the destruction and chaotic aftermath of the 2004 Indian Ocean tsunami.10.1%
 
A retired CIA agent travels across Europe and relies on his old skills to save his estranged daughter, who has been kidnapped while on a trip to Paris.10.1%
 
Other values (990)99099.0%
 
2020-12-12T12:52:28.653461image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique1000 ?
Unique (%)100.0%
2020-12-12T12:52:28.925936image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length421
Median length159
Mean length163.232
Min length42

Director
Categorical

HIGH CARDINALITY
UNIFORM

Distinct644
Distinct (%)64.4%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
Ridley Scott
 
8
David Yates
 
6
Michael Bay
 
6
M. Night Shyamalan
 
6
Paul W.S. Anderson
 
6
Other values (639)
968 
ValueCountFrequency (%) 
Ridley Scott80.8%
 
David Yates60.6%
 
Michael Bay60.6%
 
M. Night Shyamalan60.6%
 
Paul W.S. Anderson60.6%
 
J.J. Abrams50.5%
 
Christopher Nolan50.5%
 
Peter Berg50.5%
 
Martin Scorsese50.5%
 
Zack Snyder50.5%
 
Other values (634)94394.3%
 
2020-12-12T12:52:29.207118image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique444 ?
Unique (%)44.4%
2020-12-12T12:52:29.441477image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length32
Median length13
Mean length13.139
Min length3

Actors
Categorical

HIGH CARDINALITY
UNIFORM

Distinct996
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
Daniel Radcliffe, Emma Watson, Rupert Grint, Michael Gambon
 
2
Jennifer Lawrence, Josh Hutcherson, Liam Hemsworth, Woody Harrelson
 
2
Gerard Butler, Aaron Eckhart, Morgan Freeman,Angela Bassett
 
2
Shia LaBeouf, Megan Fox, Josh Duhamel, Tyrese Gibson
 
2
Robert De Niro, Leslie Mann, Danny DeVito, Edie Falco
 
1
Other values (991)
991 
ValueCountFrequency (%) 
Daniel Radcliffe, Emma Watson, Rupert Grint, Michael Gambon20.2%
 
Jennifer Lawrence, Josh Hutcherson, Liam Hemsworth, Woody Harrelson20.2%
 
Gerard Butler, Aaron Eckhart, Morgan Freeman,Angela Bassett20.2%
 
Shia LaBeouf, Megan Fox, Josh Duhamel, Tyrese Gibson20.2%
 
Robert De Niro, Leslie Mann, Danny DeVito, Edie Falco10.1%
 
Kristen Stewart, Robert Pattinson, Taylor Lautner, Peter Facinelli10.1%
 
Ben Winchell, Josh Brener, Maria Bello, Andy Garcia10.1%
 
Gabriel Chavarria, Demián Bichir, Theo Rossi,Tony Revolori10.1%
 
Jim Carrey, Charlotte Gainsbourg, Marton Csokas, Kati Outinen10.1%
 
Mila Kunis, Justin Timberlake, Patricia Clarkson, Jenna Elfman10.1%
 
Other values (986)98698.6%
 
2020-12-12T12:52:29.675430image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique992 ?
Unique (%)99.2%
2020-12-12T12:52:29.919778image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length77
Median length58
Mean length58.288
Min length43

Year
Real number (ℝ≥0)

Distinct11
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2012.783
Minimum2006
Maximum2016
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB
2020-12-12T12:52:30.111233image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum2006
5-th percentile2007
Q12010
median2014
Q32016
95-th percentile2016
Maximum2016
Range10
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.205961508
Coefficient of variation (CV)0.00159280037
Kurtosis-0.8219639755
Mean2012.783
Median Absolute Deviation (MAD)2
Skewness-0.6898787091
Sum2012783
Variance10.27818919
MonotocityNot monotonic
2020-12-12T12:52:30.318677image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%) 
201629729.7%
 
201512712.7%
 
2014989.8%
 
2013919.1%
 
2012646.4%
 
2011636.3%
 
2010606.0%
 
2007535.3%
 
2008525.2%
 
2009515.1%
 
ValueCountFrequency (%) 
2006444.4%
 
2007535.3%
 
2008525.2%
 
2009515.1%
 
2010606.0%
 
ValueCountFrequency (%) 
201629729.7%
 
201512712.7%
 
2014989.8%
 
2013919.1%
 
2012646.4%
 

Runtime (Minutes)
Real number (ℝ≥0)

Distinct94
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.172
Minimum66
Maximum191
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB
2020-12-12T12:52:30.546072image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum66
5-th percentile88
Q1100
median111
Q3123
95-th percentile150
Maximum191
Range125
Interquartile range (IQR)23

Descriptive statistics

Standard deviation18.81090817
Coefficient of variation (CV)0.1662152138
Kurtosis0.8583211032
Mean113.172
Median Absolute Deviation (MAD)12
Skewness0.8467127314
Sum113172
Variance353.8502663
MonotocityNot monotonic
2020-12-12T12:52:30.777219image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
108313.1%
 
100282.8%
 
117272.7%
 
110262.6%
 
106262.6%
 
118262.6%
 
102252.5%
 
112242.4%
 
104232.3%
 
123232.3%
 
Other values (84)74174.1%
 
ValueCountFrequency (%) 
6610.1%
 
7320.2%
 
8020.2%
 
8150.5%
 
8210.1%
 
ValueCountFrequency (%) 
19110.1%
 
18710.1%
 
18030.3%
 
17210.1%
 
17010.1%
 

Rating
Real number (ℝ≥0)

Distinct55
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.7232
Minimum1.9
Maximum9
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB
2020-12-12T12:52:31.035602image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1.9
5-th percentile5.1
Q16.2
median6.8
Q37.4
95-th percentile8.1
Maximum9
Range7.1
Interquartile range (IQR)1.2

Descriptive statistics

Standard deviation0.9454287893
Coefficient of variation (CV)0.1406218451
Kurtosis1.322270288
Mean6.7232
Median Absolute Deviation (MAD)0.6
Skewness-0.7431419408
Sum6723.2
Variance0.8938355956
MonotocityNot monotonic
2020-12-12T12:52:31.249283image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
7.1525.2%
 
6.7484.8%
 
7464.6%
 
6.3444.4%
 
6.6424.2%
 
7.2424.2%
 
7.3424.2%
 
6.5404.0%
 
7.8404.0%
 
6.2373.7%
 
Other values (45)56756.7%
 
ValueCountFrequency (%) 
1.910.1%
 
2.720.2%
 
3.210.1%
 
3.520.2%
 
3.720.2%
 
ValueCountFrequency (%) 
910.1%
 
8.820.2%
 
8.630.3%
 
8.560.6%
 
8.440.4%
 

Votes
Real number (ℝ≥0)

Distinct997
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean169808.255
Minimum61
Maximum1791916
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB
2020-12-12T12:52:31.719774image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum61
5-th percentile1260.35
Q136309
median110799
Q3239909.75
95-th percentile526551.85
Maximum1791916
Range1791855
Interquartile range (IQR)203600.75

Descriptive statistics

Standard deviation188762.6475
Coefficient of variation (CV)1.111622327
Kurtosis11.3126809
Mean169808.255
Median Absolute Deviation (MAD)88402
Skewness2.507918483
Sum169808255
Variance3.56313371e+10
MonotocityNot monotonic
2020-12-12T12:52:31.945171image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
142720.2%
 
9714120.2%
 
29120.2%
 
53111210.1%
 
70210.1%
 
4780410.1%
 
22661910.1%
 
7646910.1%
 
12569310.1%
 
17455310.1%
 
Other values (987)98798.7%
 
ValueCountFrequency (%) 
6110.1%
 
9610.1%
 
10210.1%
 
11510.1%
 
16410.1%
 
ValueCountFrequency (%) 
179191610.1%
 
158362510.1%
 
122264510.1%
 
104774710.1%
 
104558810.1%
 

Revenue (Millions)
Real number (ℝ≥0)

MISSING

Distinct814
Distinct (%)93.3%
Missing128
Missing (%)12.8%
Infinite0
Infinite (%)0.0%
Mean82.95637615
Minimum0
Maximum936.63
Zeros1
Zeros (%)0.1%
Memory size7.8 KiB
2020-12-12T12:52:32.188520image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.211
Q113.27
median47.985
Q3113.715
95-th percentile293.88
Maximum936.63
Range936.63
Interquartile range (IQR)100.445

Descriptive statistics

Standard deviation103.2535405
Coefficient of variation (CV)1.244672746
Kurtosis10.60763453
Mean82.95637615
Median Absolute Deviation (MAD)41.285
Skewness2.592515866
Sum72337.96
Variance10661.29362
MonotocityNot monotonic
2020-12-12T12:52:32.435859image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.0370.7%
 
0.0150.5%
 
0.0440.4%
 
0.0240.4%
 
0.3240.4%
 
0.0540.4%
 
1.2930.3%
 
0.1530.3%
 
2.230.3%
 
0.5430.3%
 
Other values (804)83283.2%
 
(Missing)12812.8%
 
ValueCountFrequency (%) 
010.1%
 
0.0150.5%
 
0.0240.4%
 
0.0370.7%
 
0.0440.4%
 
ValueCountFrequency (%) 
936.6310.1%
 
760.5110.1%
 
652.1810.1%
 
623.2810.1%
 
533.3210.1%
 

Metascore
Real number (ℝ≥0)

MISSING

Distinct84
Distinct (%)9.0%
Missing64
Missing (%)6.4%
Infinite0
Infinite (%)0.0%
Mean58.98504274
Minimum11
Maximum100
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB
2020-12-12T12:52:32.670232image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile31
Q147
median59.5
Q372
95-th percentile85
Maximum100
Range89
Interquartile range (IQR)25

Descriptive statistics

Standard deviation17.19475702
Coefficient of variation (CV)0.2915104614
Kurtosis-0.6122051468
Mean58.98504274
Median Absolute Deviation (MAD)12.5
Skewness-0.1238873467
Sum55210
Variance295.6596691
MonotocityNot monotonic
2020-12-12T12:52:32.896663image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
66252.5%
 
72252.5%
 
68252.5%
 
64242.4%
 
57232.3%
 
51222.2%
 
65222.2%
 
48212.1%
 
81212.1%
 
76212.1%
 
Other values (74)70770.7%
 
(Missing)646.4%
 
ValueCountFrequency (%) 
1110.1%
 
1510.1%
 
1610.1%
 
1840.4%
 
1910.1%
 
ValueCountFrequency (%) 
10010.1%
 
9910.1%
 
9810.1%
 
9640.4%
 
9530.3%
 

Interactions

2020-12-12T12:52:12.830301image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:13.054667image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:13.313013image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:13.559378image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:13.817687image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:14.084938image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:14.308871image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:14.541904image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:14.772973image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:14.991703image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:15.238535image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:15.480257image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:15.719048image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:15.931452image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:16.148874image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:16.390261image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:16.615658image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:16.865952image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:17.186096image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:17.431471image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:17.644898image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:17.857336image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:18.076744image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:18.300148image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:18.530533image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:18.760915image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:19.045127image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:19.319395image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:19.593659image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:19.945719image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:20.305757image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:20.704304image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:21.078522image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:21.351730image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:21.610267image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:21.854923image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:22.055387image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:22.262800image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:22.543052image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:22.769131image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:23.003458image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:23.190877image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:23.409613image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:23.597033image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:23.812218image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:24.038612image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:24.288945image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:24.569163image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:24.783166image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Correlations

2020-12-12T12:52:33.090109image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2020-12-12T12:52:33.378339image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2020-12-12T12:52:33.735384image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2020-12-12T12:52:34.020621image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2020-12-12T12:52:25.222591image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:25.733025image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:26.408186image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-12-12T12:52:26.564765image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Sample

First rows

RankTitleGenreDescriptionDirectorActorsYearRuntime (Minutes)RatingVotesRevenue (Millions)Metascore
01Guardians of the GalaxyAction,Adventure,Sci-FiA group of intergalactic criminals are forced to work together to stop a fanatical warrior from taking control of the universe.James GunnChris Pratt, Vin Diesel, Bradley Cooper, Zoe Saldana20141218.1757074333.1376.0
12PrometheusAdventure,Mystery,Sci-FiFollowing clues to the origin of mankind, a team finds a structure on a distant moon, but they soon realize they are not alone.Ridley ScottNoomi Rapace, Logan Marshall-Green, Michael Fassbender, Charlize Theron20121247.0485820126.4665.0
23SplitHorror,ThrillerThree girls are kidnapped by a man with a diagnosed 23 distinct personalities. They must try to escape before the apparent emergence of a frightful new 24th.M. Night ShyamalanJames McAvoy, Anya Taylor-Joy, Haley Lu Richardson, Jessica Sula20161177.3157606138.1262.0
34SingAnimation,Comedy,FamilyIn a city of humanoid animals, a hustling theater impresario's attempt to save his theater with a singing competition becomes grander than he anticipates even as its finalists' find that their lives will never be the same.Christophe LourdeletMatthew McConaughey,Reese Witherspoon, Seth MacFarlane, Scarlett Johansson20161087.260545270.3259.0
45Suicide SquadAction,Adventure,FantasyA secret government agency recruits some of the most dangerous incarcerated super-villains to form a defensive task force. Their first mission: save the world from the apocalypse.David AyerWill Smith, Jared Leto, Margot Robbie, Viola Davis20161236.2393727325.0240.0
56The Great WallAction,Adventure,FantasyEuropean mercenaries searching for black powder become embroiled in the defense of the Great Wall of China against a horde of monstrous creatures.Yimou ZhangMatt Damon, Tian Jing, Willem Dafoe, Andy Lau20161036.15603645.1342.0
67La La LandComedy,Drama,MusicA jazz pianist falls for an aspiring actress in Los Angeles.Damien ChazelleRyan Gosling, Emma Stone, Rosemarie DeWitt, J.K. Simmons20161288.3258682151.0693.0
78MindhornComedyA has-been actor best known for playing the title character in the 1980s detective series "Mindhorn" must work with the police when a serial killer says that he will only speak with Detective Mindhorn, whom he believes to be a real person.Sean FoleyEssie Davis, Andrea Riseborough, Julian Barratt,Kenneth Branagh2016896.42490NaN71.0
89The Lost City of ZAction,Adventure,BiographyA true-life drama, centering on British explorer Col. Percival Fawcett, who disappeared while searching for a mysterious city in the Amazon in the 1920s.James GrayCharlie Hunnam, Robert Pattinson, Sienna Miller, Tom Holland20161417.171888.0178.0
910PassengersAdventure,Drama,RomanceA spacecraft traveling to a distant colony planet and transporting thousands of people has a malfunction in its sleep chambers. As a result, two passengers are awakened 90 years early.Morten TyldumJennifer Lawrence, Chris Pratt, Michael Sheen,Laurence Fishburne20161167.0192177100.0141.0

Last rows

RankTitleGenreDescriptionDirectorActorsYearRuntime (Minutes)RatingVotesRevenue (Millions)Metascore
990991Underworld: Rise of the LycansAction,Adventure,FantasyAn origins story centered on the centuries-old feud between the race of aristocratic vampires and their onetime slaves, the Lycans.Patrick TatopoulosRhona Mitra, Michael Sheen, Bill Nighy, Steven Mackintosh2009926.612970845.8044.0
991992Taare Zameen ParDrama,Family,MusicAn eight-year-old boy is thought to be a lazy trouble-maker, until the new art teacher has the patience and compassion to discover the real problem behind his struggles in school.Aamir KhanDarsheel Safary, Aamir Khan, Tanay Chheda, Sachet Engineer20071658.51026971.2042.0
992993Take Me Home TonightComedy,Drama,RomanceFour years after graduation, an awkward high school genius uses his sister's boyfriend's Labor Day party as the perfect opportunity to make his move on his high school crush.Michael DowseTopher Grace, Anna Faris, Dan Fogler, Teresa Palmer2011976.3454196.92NaN
993994Resident Evil: AfterlifeAction,Adventure,HorrorWhile still out to destroy the evil Umbrella Corporation, Alice joins a group of survivors living in a prison surrounded by the infected who also want to relocate to the mysterious but supposedly unharmed safe haven known only as Arcadia.Paul W.S. AndersonMilla Jovovich, Ali Larter, Wentworth Miller,Kim Coates2010975.914090060.1337.0
994995Project XComedy3 high school seniors throw a birthday party to make a name for themselves. As the night progresses, things spiral out of control as word of the party spreads.Nima NourizadehThomas Mann, Oliver Cooper, Jonathan Daniel Brown, Dax Flame2012886.716408854.7248.0
995996Secret in Their EyesCrime,Drama,MysteryA tight-knit team of rising investigators, along with their supervisor, is suddenly torn apart when they discover that one of their own teenage daughters has been brutally murdered.Billy RayChiwetel Ejiofor, Nicole Kidman, Julia Roberts, Dean Norris20151116.227585NaN45.0
996997Hostel: Part IIHorrorThree American college students studying abroad are lured to a Slovakian hostel, and discover the grim reality behind it.Eli RothLauren German, Heather Matarazzo, Bijou Phillips, Roger Bart2007945.57315217.5446.0
997998Step Up 2: The StreetsDrama,Music,RomanceRomantic sparks occur between two dance students from different backgrounds at the Maryland School of the Arts.Jon M. ChuRobert Hoffman, Briana Evigan, Cassie Ventura, Adam G. Sevani2008986.27069958.0150.0
998999Search PartyAdventure,ComedyA pair of friends embark on a mission to reunite their pal with the woman he was going to marry.Scot ArmstrongAdam Pally, T.J. Miller, Thomas Middleditch,Shannon Woodward2014935.64881NaN22.0
9991000Nine LivesComedy,Family,FantasyA stuffy businessman finds himself trapped inside the body of his family's cat.Barry SonnenfeldKevin Spacey, Jennifer Garner, Robbie Amell,Cheryl Hines2016875.31243519.6411.0